ivaw Sequence Listing Error Nummary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER; 



: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA- HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



Wrapped Nudeics 



Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating iL 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The amino add number/text at the end of each line Vrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this v^nll prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This Indudes spaces. 

Misaligned Amino Add The numbering under each 5th amino add is misaligned. This may be caused by the use of tabs 

Numbering between the numbering. It is recommended to delete any tabs and use spadng between the numbers. 

Non-ASCII 



Variable Length 



Patentin ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 
Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

Indicate in the (ix) feature section that some may be missing, 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino add 
sequence(s) . Normally, Patentin would automatically generate this section from the 

previously coded nudeic add sequence. Please manually copy the relevant <220>-<223> section 

to the subsequent amino add sequence. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(I) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xl) SEQUENCE DESCRIPTIONrSEQ ID NO:X: 
This sequence Is Intentionally skipped 

Please also adjust the "(ili) NUMBER OF SEQUENCES:" response to Indude the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence Id number 
<400> sequence Id number 
000 



Use of n's or Xaa's 
(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing, 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



Use of <213>Organism 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 



Patentin ver. 2.0 "bug" 



Sequence(s}_ 



are missing this mandatory field or its response. 



Sequence(s) are missing the <220>Feature and assodated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artifidar or "Unknown" 
Please explain source of genetic material In <220> to <223> section. 

(See "Federal Register," 6/01/98. Vol. 63. No. 104. pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a conupted 
file, resulting in missing mandatory numeric identiriers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to floppy disk. 
AKS-Biotechnology Systems Branch- 5/15/99 
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PAGE: 1 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:33 

INPUT SET: S33211.raw 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. r> A) 



1 SEQUENCE LISTING 

3 (1) General Information: 
4 

5 (i) APPLICANT: STUIVER, Maarten Hendrik 

6 CUSTERS, Jerome Humbertina Henricus Victor 

7 SELA-BURIiAGE, Marianne Beatrix 

8 MELCHERS, Leo Sjoerd 

9 VAN DEVENTER-TROOST, Johanna Pieternella 

10 LAGEWEG, Wessel 

11 PONSTEIN, Anne Silene 

12 LAGEWEG, Wessel 

13 PONSTEIN, Anne Silene 
14 

15 (ii) TITLE OF INVENTION: ANTIFUNGAL PROTEINS, DNA CODING THEREFOR, 

16 AND HOSTS INCORPORATING SAME. 

18 (iii) NUMBER OF SEQUENCES: 75 |i^S*V"4 ^(IrJi r^>|tLV>V — jX/^ yO^uX^J^-U^^ 

2 0 (iv) CORRESPONDENCE ADDRESS : Ift A) ADDRESSEE: LADAS & PARRY A /) 

21 (B) street;^ 26 WEST 61 STREET UA/OSf(^ 

-> 22 (C) city; ^ NEW YORK 

23 {D)r^57irr£f ^ . 

■> 24 

-> 25 (F)jjrLUUl!3 - 7604 

2 6 

27 (v) COMPUTER READABLE FORM:^(A) MEDIUM TYPE: 3.25" Floppy disk 

28 L I e I COMPUTER: IBM PC compatible 

29 \^'^u)'V\^y^f^f^ (C) OPERATING SYSTEM: WINDOWS 95 

3 0 * (D) SOFTWARE: WORDPERFECT 8 

31 

32 (vi) CURRENT APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/258,031 

34 (B) FILING DATE: 25-FEB-1999 

35 (C) CLASSIFICATION: 435 
36 

3 7 (vii) PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: PCT/EP97/04923 

39 (B) FILING DATE: 04-SEP-1997^ 
40 

41 (vii) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NUMBER: EP97200831.2 

43 (B) FILING DATE: 19-MAR-1997 
44 

45 (vii) PRIOR APPLICATION DATA: 



PAGE: 2 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:33 



INPUT SET: S33211.raw 

46 .(A) APPLICATION NUMBER: EP96202466.7 

47 (B) FILING DATE: 04-SEP-1996 
48 



ERRORED SEQUENCES FOLLOW: 



150 (2) INFORMATION FOR SEQ ID NO : 5: 
151 

152 (i) SEQUENCE CHARACTERISTICS: 

--> 153 (A) LENGTH: 354 base pairs 

154 (B) TYPE: nucleic acid 

155 (C) STRANDEDNESS: double / y 

156 (D) TOPOLOGY: linear / /U^ J 

158 (ii) MOLECULE TYPE: DNA (genomic) f 

159 \ 

160 (iii) HYPOTHETICAL: NO 
161 

162 (iii) ANTI-SENSE: NO 

163 

164 (vi) ORIGINAL SOURCE: 

165 (A) ORGANISM: Helianthus annuus 

166 (B) STRAIN: cv . zebulon 
167 

168 (ix) FEATURE: 

169 (A) NAME/KEY: CDS 

170 (B) LOCATION: 1..354 
171 

172 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

173 

174 

175 

176 GAT CCG TCT TTC CCG ATT ACT GGG GAG GTT TAC ACT CCC GGA AAC TCA 48 

177 Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 

178 15 10 15 
179 

180 TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT 96 

181 Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn 

182 20 25 30 
183 

184 GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 

185 . Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr Ala Glu His Val 

186 35 40 45 
187 

188 TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 

189 Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 

190 50 55 60 
191 

192 CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 240 

193 Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 



m 



PAGE: 3 IlAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:34 

INPUT SET: S33211.raw 
194 65 70 75 80 

195 

196 ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAG ATG TTC AAT TTA AGG 2 88 

197 Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met Phe Asn Leu Arg 

198 85 ^ 90 95 



2 00 TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC ^^f^^^ 

201 336 . I — ^ Op^^ 

202 Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly ^ ^^JJ- 

Ala Thr Leu Gly Glu Val Z^M^ 



203 100 105 110 

204 

205 GCC ACC CTC GGA GAA GTT 354 

206 

207 115 
208 
209 




412 (2) INFORMATION FOR SEQ ID NO: 15: 
413 

414 (i) SEQUENCE CHARACTERISTICS: 

415 (A) LENGTH: 1784 base pairs 

416 (B) TYPE: nucleic acid 

417 (C) STRANDEDNESS : double 

418 (D) TOPOLOGY: linear 
419 

42 0 (ii) MOLECULE TYPE: cDNA to mRNA 
421 

422 (iii) HYPOTHETICAL: NO 

423 

424 (iii) ANTI-SENSE: NO 

425 

426 (vi) ORIGINAL SOURCE: 

427 (A) ORGANISM: Helianthus annuus 

428 (B) STRAIN: Zebulon 
429 

43 0 (ix) FEATURE: 

431 (A) NAME/KEY: CDS 

432 (B) LOCATION: 21.. 1608 
433 

434 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

435 

436 ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 50 
43 7 Met Gin Thr Ser lie Leu Thr Leu Leu Leu 

438 1 5 10 

439 

440 CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 

441 Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 

442 15 20 25 
443 

444 CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 146 

445 Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro lie 

446 30 35 40 
447 



PAGE- 4 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031 A TIME: 14:51:34 

INPUT SET: S33211.raw 



448 


ACC 


GGA 


GAG 


GTT 


TAC 


ACT 


CCC 


GGA 


AAC 


TCA 


TCT 


TTT 


CCT 


ACC 


GTC 


TTG 


194 


449 


Thr Gly Glu Val 


Tyr 


Thr 


Pro 


Gly 


Asn 


Ser 


Ser 


Phe 


Pro 


Thr 


Val 


Leu 




450 






45 










50 










55 










451 




































452 


CAA 


AAC 


TAC 


ATC 


CGA 


AAC 


CTT 


CGG 


TTC 


AAT 


GAA 


ACT 


ACC 


ACA 


CCA 


AAA 


242 


453 


Gin 


Asn 


Tyr 


He 


Arg 


Asn 


Leu 


Arg 


Phe 


Asn 


Glu 


Thr 


Thr 


Thr 


Pro 


Lys 




454 




60 










65 










70 












455 




































456 


CCC 


TTT 


TTA 


ATC 


ATC 


ACA 


GCC 


GAA 


CAT 


GTT 


TCC 


CAC 


ATT 


CAG 


GCA 


GCT 


290 


457 


Pro 


Phe 


Leu 


He 


He 


Thr 


Ala 


Glu 


His 


Val 


Ser 


His 


He 


Gin 


Ala 


Ala 




458 


75 










80 










85 










90 




459 
460 


GTG 


GTT 


TGT 


GGC 


AAA 


CAA 


AAC 


CGG 


TTG 


CTA 


CTG 


AAA 


ACC 


AGA 


AGC 


GGT 




461 
462 


338 
Val 


Val 


Cys 


Gly 


Lys 


Gin 


Asn 


Arg 


Leu 


Leu 


Leu 


Lys 


Thr 


Arg 


Ser 


Gly 




463 










95 










100 










105 






464 


GOT 


CAT 


GAT 


TAT 


GAA 


GGT 


CTT 


TCC 


TAC 


CTT 


ACA 


AAC 


ACA 


AAC 


CAA 


CCC 


386 


465 


Gly His Asp 


Tyr 


Glu 


Gly 


Leu 


Ser 


Tyr 


Leu 


Thr 


Asn 


Thr 


Asn 


Gin 


Pro 




466 








110 










115 










120 








467 




































468 


TTC 


TTC 


ATT 


GTG 


GAC 


ATG 


TTC 


AAT 


TTA 


AGG 


TCC 


ATA 


AAC 


GTA 


GAT 


ATC 


434 


469 


Phe 


Phe 


lie 


Val 


Asp 


Met 


Phe 


Asn 


Leu 


Arg 


Ser 


He 


Asn 


Val 


Asp 


He 




470 






125 










130 










135 










471 




































472 


GAA 


CAA 


GAA 


ACC 


GCA 


TGG 


GTC 


CAA 


GCC 


GGT 


GCG 


ACT 


CTT 


GGT 


GAA 


GTG 




473 
474 


482 
Glu 


Gin 


Glu 


Thr 


Ala 


Trp 


Val 


Gin 


Ala 


Gly 


Ala 


Thr 


Leu 


Gly 


Glu 


Val 


475 




140 










145 










150 










J 


476 




































477 


TAG 


TAT 


CGA 


ATA 


GCG 


GAG 


AAA 


AGT 


AAC 


AAG 


CAT 


GGT 


TTT 


CCG 


GCA 


GGG 




478 


530 


































479 


Tyr 


Tyr 


Arg 


He 


Ala 


Glu 


Lys 


Ser 


Asn 


Lys 


His 


Gly 


Phe 


Pro 


Ala 


Gly 




480 


155 










160 










165 










170 




481 




































482 


GTT 


TGT 


CCA 


ACG 


GTT 


GGC 


GTT 


GGT 


GGG 


CAT 


TTT 


AGT 


GGT 


GGT 


GGG 


TAT 


578 


483 


Val 


Cys 


Pro 


Thr 


Val 


Gly 


Val 


Gly 


Gly 


His 


Phe 


Ser 


Gly 


Gly 


Gly 


Tyr 




484 










175 










180 










185 






485 




































486 


GGT 


AAT 


TTG 


ATG 


AGA 


AAA 


TAT 


GGT 


TTG 


TCG 


GTT 


GAT 


AAT 


ATT 


GTT 


GAT 


626 


487 


Gly Asn 


Leu 


Met 


Arg 


Lys 


Tyr 


Gly 


Leu 


Ser 


Val 


Asp 


Asn 


He 


Val 


Asp 




488 








190 










195 










200 








489 




































490 


GCT 


CAA 


ATA 


ATA 


GAT 


GTG 


AAT 


GGC 


AAG 


CTT 


TTG 


GAT 


CGA 


AAG 


AGT 


ATG 


674 


491 


Ala 


Gin 


He 


He 


Asp 


Val 


Asn 


Gly 


Lys 


Leu 


Leu 


Asp 


Arg 


Lys 


Ser 


Met 




492 






205 










210 










215 










493 




































494 


GGT 


GAG 


GAT 


TTG 


TTT 


TGG 


GCG 


ATC 


ACC 


GGC 


GGT 


GGT 


GGT 


GTT 


AGT 


TTT 


722 


495 


Gly Glu Asp 


Leu 


Phe 


Trp 


Ala 


He 


Thr 


Gly 


Gly 


Gly 


Gly 


Val 


Ser 


Phe 




496 




220 










225 










230 












497 




































498 


GGT 


GTG 


GTT 


CTA 


GCC 


TAC 


AAA 


ATC 


AAA 


CTA 


GTT 


CGT 


GTT 


CCG 


GAG 


GTT 


770 


499 


Gly Val Val 


Leu 


Ala 


Tyr 


Lys 


He 


Lys 


Leu 


Val 


Arg 


Val 


Pro 


Glu 


Val 




500 


235 










240 










245 










250 





PAGE: 5 , RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:35 



INPUT SET: S33211.raw 

501 

502 GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 

503 818 

504 Val Thr Val Phe Thr lie Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 

505 255 260 265 
506 

507 ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 866 

508 lie Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 

509 270 275 280 
510 

511 TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 

512 Phe Leu Arg Met Thr Phe Ser Val lie Asn Asp Thr Asn Gly Gly Lys 

513 285 290 295 
514 

515 ACA GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 

516 Thr Val Arg Ala lie Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 

517 300 305 310 
518 

519 CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
52 0 Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
521 315 320 325 330 

522 

52 3 TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 1058 

524 Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 

525 335 340 345 
526 

527 GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 

528 Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 

529 350 355 360 
530 

531 
532 

53 3 CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 1154 

534 Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr Val Gin Asn Pro 

535 365 370 375 
536 

53 7 ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 12 02 

538 lie Ser Lys Arg Gin Phe Glu Phe lie Phe Glu Arg Leu Lys Glu Leu 

539 380 385 390 
540 

541 GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA JU 

542 1250 

543 Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu 

544 395 400 405 410 
545 

546 ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 1298 

547 lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn lie Ala 

548 415 420 425 
549 

550 AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 

551 . 1346 

552 Lys lie Glh Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 

553 430 435 440 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:35 



INPUT SET: S3321Lraw 

554 

555 AAT CGT TAG TTG AAT TTC ACA AGG CTG ATG TAT GAT TAG ATG AGG GGA 1394 

556 Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 

557 445 450 455 
558 

559 TTT GTG TCG AAA AAC OCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 1442 

560 Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 

561 460 465 470 
562 

563 GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 

564 1490 

565 Asp lie Gly lie Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 

566 475 480 485 490 
567 

568 GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAG AAG AGG CTA GTA 

569 1538 

570 Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 

571 495 500 505 
572 

573 AGT GTG AAG ACT AAA GTT GAT GCT GAC AAC TTC TTT AGG AAT GAG GAA 

574 1586 

575 Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 

576 510 515 520 
577 

578 AGC ATC CCA ACT TTG TCA TCT T GAAGAACGTA CATATATAAA TAAATACCTT 

579 1638 

580 Ser lie Pro Thr Leu Ser Ser 

581 525 
582 

583 TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA 

584 GAATTTTGAC 1698 
585 

586 TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA 

587 ATTGTGAAAA 1758 
588 

58 9 ATACATATTA GTATTGTCAA AAAAAA 1784 

590 

591 



743 


(2) INFORMATION FOR SEQ ID NO : 19: 


744 






745 


(i) 


SEQUENCE CHARACTERISTICS: 


746 




(A) LENGTH: 1589 base pairs 


747 




(B) TYPE: nucleic acid 


748 




(C) STRANDEDNESS : double 


749 




(D) TOPOLOGY: linear 


750 






751 


(ii) 


MOLECULE TYPE: cDNA to mRNA 


752 






753 


(iii) 


HYPOTHETICAL: NO 


754 






755 


(iii) 


ANTI- SENSE: NO 




PAGE: 7 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:35 

INPUT SET: S33211.raw 

757 (vi) ORIGINAL SOURCE; 

758 (A) ORGANISM: Helianthus annuus 

759 (B) STRAIN: Zebulon 
760 

761 (ix) FEATURE: 

762 " (A) NAME/KEY: CDS 

763 (B) LOCATION: 1..1590 
764 

765 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

766 

767 ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 48 

768 Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 

769 15 10 15 
770 

771 
772 

773 TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 96 

774 Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 

775 20 25 30 
776 

777 CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAC ACT 144 

778 His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 

779 35 40 45 
780 

781 CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC 192 

782 Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 

783 50 55 60 
784 

785 CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 24 0 

786 Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 

787 65 70 75 80 
788 

789 GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 288 
7 90 Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
791 85 90 95 

792 

793 AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 336 

794 Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 

795 100 105 110 
796 

7 97 CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 384 
7 98 Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
799 115 120 125 

800 

801 TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 432 

802 Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 

803 130 135 140 
804 

805 GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG TAC TAT CGA ATA GCG GAG 480 

806 Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg He Ala Glu 

807 145 150 - 155 160 
808 

809 AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 528 



I 
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810 
811 
812 
813 
814 
815 
816 
817 
818 
819 
820 
821 
822 
823 
824 
825 
826 
827 
828 
829 
830 
831 
832 
833 
834 
835 
836 
837 
838 
839 
840 
841 
842 
843 
844 
845 
846 
847 
848 
849 
850 
851 
852 
853 
854 
855 
856 
857 
858 
859 
860 
861 
862 



RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:36 

INPUT SET: S33211,raw 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 



GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 576 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 



TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 624 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 672 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAG 720 

Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 

225 230 235 240 



AAA ATC AAA CTA 
Lys He Lys Leu 



GTT CGT GTT 
Val Arg Val 
245 



CCG GAG GTT 
Pro Glu Val 
250 



GTG ACC 
Val Thr 



GTG TTT 
Val Phe 



GAA AGA 
816 

Glu Arg 



AGA GAG GAA CAA AAC CTC AGC ACC 
Glu Gin Asn 



Arg Glu 
260 



Leu Ser Thr 
265 



ATC GCG GAA CGA 

He Ala Glu Arg 
270 



ACC ATT 
Thr He 
255 

TGG GTA 
Trp Val 



768 



CAA GTT 

Gin Val 



GCT GAT 

Ala Asp 
275 



AAG CTA GAT 

Lys Leu Asp 



AGA GAT CTT 

Arg Asp Leu 
280 



TTC CTT 

Phe Leu 



CGA ATG 

Arg Met 
285 



ACC TTT 

Thr Phe 



864 



AGT GTC 
Ser Val 
290 



ATA AAC 
He Asn 



GAT ACC AAC 
Asp Thr Asn 
295 



GGT GGA AAG 
Gly Gly Lys 



ACA GTC 
Thr Val 
300 



CGT GCT 
Arg Ala 



ATC TTT 
He Phe 



912 



CCA ACG 
Pro Thr 
305 

AAA GAT 
1008 
Lys Asp 



TTG TAC 
Leu Tyr 



CTT GGA AAC 
Leu Gly Asn 
310 



TCG AGG AAT 
Ser Arg Asn 



TTC CCC GAG TTA GGG TTG CAA GAA 
Phe Pro 



Glu Leu Gly 
325 



Leu Gin Glu 
330 



CTT GTT 

Leu Val 
315 

TCG GAT 
Ser Asp 



ACA CTT 
Thr Leu 



TTG AAT 
Leu Asn 
320 



960 



TGT ACT GAA ATG 

Cys Thr Glu Met 
335 




AGT TGG GTT GAG 

Ser Trp Val Glu 
340 



TCT GTG CTT 

Ser Val Leu 



TAC TAC ACG 

Tyr Tyr Thr 
345 



GGC TTC 

Gly Phe 



CCC AGT 

Pro Ser 
350 



GGT ACT 

Gly Thr 



1056 



CCA ACC ACG GCG 
Pro Thr Thr Ala 



CTC TTA AGC 
Leu Leu Ser 



CGT ACT CCT 
Arg Thr Pro 



CAA AGA 
Gin Arg 



CTC AAC 
Leu Asn 



CCA TTC 
Pro Phe 



1104 



PAGE: 9 



- - > 



863 
864 
865 
866 
867 
868 
869 
870 
871 
872 
873 
874 
875 
876 
877 
878 
879 
880 
881 
882 
883 
884 
885 
886 
887 
888 
889 
890 
891 
892 
893 
894 
895 
896 
897 
898 
899 
900 
901 
902 
903 
904 
905 
906 
907 
908 
909 
910 
911 
912 
913 
914 
915 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:36 



INPUT SET: S33211.raw 



355 



360 



365 



AAG ATC 
Lys lie 
370 

GAG TTC 
1200 
Glu Phe 
385 

TTC AAC 
1248 
Phe Asn 



CCT TTC 
1296 
Pro Phe 



AAC TGG 

Asn Trp 



ACA AGG 
Thr Arg 
450 



AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 
Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
375 380 



ATC TTC GAA AGG ATG AAA GAA CTT GAA AAC CAA ATG TTG GCG 

lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
390 395 400 

CCA TAT GGT GGT AGA ATG AGT GAA ATA TCC GAA TTC GCA AAG 

Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

CCA CAT AGA TCG GGT AAC ATA GCG AAG ATT CAA TAC GAA GTA 

Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 



GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 1344 

Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 

Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
455 460 



AGA GAA GCA TTT TTG AAC 
Arg Glu Ala Phe Leu Asn 
465 470 



CAT GGC AGG AAT GCT TAT 
1488 

His Gly Arg Asn 



TTC AAA GAG ACA 
1536 

Phe Lys Glu Thr 
500 

GAT CCT GAC AAC 

Asp Pro Asp Asn 
515 

TCT TG 
Ser 

530 



Ala Tyr 
485 

AAT TAC 
Asn Tyr 



TAT AGG 
Tyr Arg 



ACT GAA 
Thr Glu 



GAT TTG GAT ATT GGT ATC AAC AGC 1440 
Asp Leu Asp lie Gly lie Asn Ser 
475 480 



GGA ATG GTT TAT GGG CAC AAA TAT 
Val Tyr 



Gly Met 
490 



Gly His Lys Tyr 
495 



AAG AGG CTA GTA 
Lys Arg 



TTC TTT 

Phe Phe 



AGG AAT 

Arg Asn 
520 



Leu Val 
505 

GAG CAA 

Glu Gin 



AGT GTG AAG ACT AAA GTT 

Ser Val Lys Thr Lys Val 
510 

AGC ATC CCA ACT TTG TCA 1584 

Ser lie Pro Thr Leu Ser 
525 

1589 



PAGE: 10 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:36 



INPUT SET: S33211.raw 

916 





±Uo ± 


fO\ T'NT'Cr^'PMTiT'TOM T?n"P CT?0 TT\ "Km* 0 T • 
\^} ±JNr wrCl^in,! XwlN r UrC OCiS^ XL/ iNU . zx. 






T ri o 








T T 






— > 










1 n 1 


V 13 / 1 X ±rIL . ilLLv-XcX^ CL^XU. 






10 J O 


\ 7 O IKAJNL'iliL^JNiloo I CLOLLJjXc 






IOj / 


\L>} lUxrUijUurx : XXIicajT 






T n "3 Q 

lU J o 








T r» o Q 

10 J y 


^xxj iyiUXjiljL.Uijiii LxfiLl CUrii\ tO turCLNA 






T rt /I n 
104U 








1041 


/ -i -i -1 \ TJV'D/^'T'U'C'T'T/^AT • "NTP* 






1042 








104 J 


/ -! -i -i \ TVIVT'T'T O'CMO'C • MO 

^lli; AN 1 X-hb^Nhti t JMU 






1044 








1045 


(VI ) ORIGINAL SOURCE : 






1046 


(A) ORGANISM: Arabidopsis t:ha.lxa.na. 






1047 


(B) STRAIN : ecotype ColuiulDia 






1048 


( ix) FEATURE : 






104 9 


(A; NAJyiE/KEY: CDS 






1050 


(B; LOCATION; 2. .350 






1051 








1052 


VXX; SEQUENCE DESCRIPTION: bEy ID NO: 21: 






1053 








1054 


O TV O TV A TV OrnOO O TV O TV OiTirrnT«/-1 A O A O TV TV 'T'OOOT* AAOOT'OAAAO lyOOOAOOOOA 

GAG AAAC T CG GAGAC TTTCA C AC AA xGC C i AAC C i C AAAC i C C GAC C C C A 




— > 


1 n c c 

1055 


AACATCCCAT O U 






1056 








1057 


CTCCCCCGCT ATCTTCTTCT CCGGAAAJLGG CiCCiACiCt- IL-L-Lri Ai i At, 




— > 


1 n c o 
lOOO 


A A /^^/^ A A ^A Tl T on A 

AAGCCAACAT 12 U /\ 






1059 








1060 


CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA / *^ 




— > 


lOO J. 


rnr<^/^/^or»A A 1 on 
TCGCCGCAAC lc5 U 






A ^ o 
1062 








1053 


TV O A rpO A A nnOO O A fTiOrpOO A A O OOOOO A TTT" A O T'lT'O OOOO A A A OOOOAOAAOO 

ACATGAATCC CATGTGCAAG CCGCGAi lAC liGCGGGAAA Ct:»CCAV-AAL-v- 




— > 


1 n c A 


T T CAGATGAA 2 4 U 






1065 








1066 


TV TV TV ^ TV TV 7\ /~1 /~1 7\ TV ^rp TV ^O TV OrpT'O'TiO'A n^TV O O nTT^ A O A rn A OITI 

AATCAGAAGT GGAGGCCACG ACTACGATGG CTTGTCATAC GTTACATACT 




— > 


lOo / 


CTGGCAAACC J U 0 






1068 








106 9 


GTTCTTCGTC CTCGACATGT TTAACC 1 CCG 1 i CGG i GGAi G i CGACG i GG 


1 f\ 
o 50 




1070 








1071 








1072 


(2) INFORMATION FOR SEQ ID NO : 22: 






1073 








1074 


(i) SEQUENCE CHARACTERISTICS: 




- - > 


1075 


(A) LENGTH: 278 base pairs 






1076 


(B) TYPE: nucleic acid 






1077 


(C) STRANDEDNESS : double 






1078 


(D) TOPOLOGY: linear 






1079 








1080 


(ii) MOLECULE TYPE: CDNA to itiRNA 





PAGE: 11 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:37 



INPUT SET: S33211.raw 



■ - > 



1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 
1092 
1093 
1094 
1095 
1096 
1097 
1098 
1099 
1100 
1101 
1102 
1103 
1104 
1105 
1106 
1107 
1108 
1109 
1110 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT 
GGGAGAAAAG 60 

CAGAGTCCAT GGATTCCCCG CCGGAGTTTG ACCGACGGTT GGTGTTGGTG 
GGCATTTAAG 120 

CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA TCTGTGGATT 
ACGTTGAGGA 180 

TGCCAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG AAAGCAATGG 
GTGAGGATCT 240 

GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 




278 



1111 


(2) INFORMATION FOR SEQ ID NO : 23: 


1112 






1113 


(i) 


SEQUENCE CHARACTERISTICS: 


1114 




(A) LENGTH: 345 base pairs 


1115 




(B) TYPE: nucleic acid 


1116 




(C) STRANDEDNESS : double 


1117 




(D) TOPOLOGY: linear 


1118 






1119 


(ii) 


MOLECULE TYPE: CDNA 


1120 






1121 


(iii) 


HYPOTHETICAL: NO 


1122 






1123 


(iii) 


ANTI- SENSE: NO 


1124 






1125 


(vi) 


ORIGINAL SOURCE: 


1126 




(A) ORGANISM: Arabidopsis thai 


1127 




(B) STRAIN: ecotype Columbia 


1128 






1129 


(ix) 


FEATURE : 


1130 




(A) NAME/KEY: CDS 


1131 




(B) LOCATION: 2 . .345 


1132 







PAGE: 12 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:37 



INPUT SET: S3321Lraw 



- -> 



1133 
1134 
1135 
1136 
1137 
1138 
1139 
1140 
1141 
1142 
1143 
1144 
1145 
1146 
1147 
1148 
1149 
1150 
1151 
1152 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT 
TAGCGTCTGA 60 

TAACGTTGTG GACGCACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC 
GGAAAACGAT 120 

GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG 
GCGTTGTCTT 180 

GTCGTGGAAG GTTAAGCTTG CTAGGGTTCC TGAAAAGGTA ACTTGTTTCA 
TAAGTCAACA 240 

TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG 
GATCAAGANN 300 

GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 




345 



1153 


(2) INFORMATION FOR SEQ ID NO: 24: 


1154 






1155 


(i) 


SEQUENCE CHARACTERISTICS: 


1156 




(A) LENGTH: 695 base pairs 


1157 




(B) TYPE: nucleic acid 


1158 




(C) STRANDEDNESS : double 


1159 




(D) TOPOLOGY: linear 


1160 






1161 


(ii) 


MOLECULE TYPE: CDNA to mRNA 


1162 






1163 


(iii) 


HYPOTHETICAL: NO 


1164 






1165 


(iii) 


ANTI- SENSE: NO 


1166 






1167 


(vi) 


ORIGINAL SOURCE: 


1168 




(A) ORGANISM: Arabidopsis thaliana 


1169 




(B) STRAIN: ecotype Columbia 


1170 






1171 


(ix) 


FEATURE : 


1172 




(A) NAME/KEY: CDS 


1173 




(B) LOCATION: 1. .6 95 


1174 






1175 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 24: 


1176 






1177 


GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA 


1178 


NGNAACCCAA 60 


1179 






1180 


NATTTCTGAA CTGGCCNCCT TCGGTGGTAT ATGNCNAAAN CCCTTGAATC 


1181 


TGCGNANCCN 120 


1182 






1183 


ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT 


1184 


GNCTAGANNG 180 




PAGE: 13 



- - > 



- - > 



- - > 



--> 



1185 
1186 
1187 
1188 
1189 
1190 
1191 
1192 
1193 
1194 
1195 
1196 
1197 
1198 
1199 
1200 
1201 
1202 
1203 
1204 
1205 
1206 
1207 
1208 
1209 
1210 
1211 
1212 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:37 



INPUT SET: S3321Lraw 



AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA 
TTACATGGCG 240 

CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT 
AGACTTCGGA 300 

CAGAACAAGA ACAACGCAAA GGTTAACTTC ATTGAAGCTA AAATCTGGGG 
ACCTAAGTAC 360 

TTCAAAGGCA ATTTTGACAG ATTGGTGAAG ATTAAAACCA AGGTTGATCC 
AGAGAACTTC 420 

TTCAGGCACG AGCAGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC 
ATGAAACCAA 480 

TAACATTATC AAAAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG 
KCTTTCTTTA 540 

TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA 
AAAAAAAATA 600 

ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT 
NGGACGGGGN 660 

GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 




695 



1213 


(2) INFORMATION FOR SEQ ID NO: 25: 


1214 






1215 


(i) 


SEQUENCE CHARACTERISTICS: 


1216 




(A) LENGTH: 495 base pairs 


1217 




(B) TYPE: nucleic acid 


1218 




(C) STRANDEDNESS : double 


1219 




(D) TOPOLOGY: linear 


1220 






1221 


(ii) 


MOLECULE TYPE: CDNA to mRNA 


1222 






1223 


(iii) 


HYPOTHETICAL: NO 


1224 






1225 


(iii) 


ANTI- SENSE: NO 


1226 






1227 


(vi) 


ORIGINAL SOURCE: 


1228 




(A) ORGANISM: Arabidopsis thai; 


1229 




(B) STRAIN: ecotype Columbia 


1230 






1231 


(ix)' 


FEATURE : 


1232 




(A) NAME/KEY: CDS 


1233 




(B) LOCATION: 2 . .495 


1234 






1235 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


1236 







PAGE: 14 



- -> 



- - > 



--> 



1237 
1238 
1239 
1240 
1241 
1242 
1243 
1244 
1245 
1246 
1247 
1248 
1249 
1250 
1251 
1252 
1253 
1254 
1255 
1256 
1257 
1258 
1259 
1260 
1261 
1262 
1263 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:38 



INPUT SET: S3321Lraw 



TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT 
TTCCTGAATT 60 

NAGTTTAAAG AAGGAGAACT GTTCGGAGAT GACTTGGTTT CAGTCAGCTT 
TATGGTGGGA 12 0 

TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA 
ATCTTGATAG 180 

AGCGT^TTTC GGAAAGAGGA AATCGGATTA CGTTGCGAGT AAGATTCCTA 
GAGATGGGAT 240 

TAAGYCTTTT TCCAAGARGA TGMCTGACCT GGGGAAAAYC GGGCTTGTTT 
TTAAWCCGTA 300 

TGGTGGGAAA ATGGCGGAGG TTACGGTTAA CGCGACGCCG TTTCCNCACC 
GAAGCAAGCT 360 

TTTTAAGATT CAGTACTCGG TGACTTNGCA AGAAAACTCT NTCGAGATAG 
AGAAAGGGTT 42 0 

TCTTGAATCA GGCTAACGTC CTTATAGGTT CATGACCGGG TTTTTNAGCA 
AGANCCCTGG 480 

AATNCTTACT TNAAT 




495 



1264 


(2) INFORMATION FOR SEQ ID NO : 26: 


1265 






1266 


(i) 


SEQUENCE CHARACTERISTICS: 


1267 




(A) LENGTH: 204 base pairs 


1268 




(B) TYPE: nucleic acid 


1269 




(C) STRANDEDNESS : double 


1270 




(D) TOPOLOGY: linear 


1271 






1272 


(ii) 


MOLECULE TYPE: cDNA to mRNA 


1273 






1274 


(iii) 


HYPOTHETICAL: NO 


1275 






1276 


(iii) 


ANTI- SENSE: NO 


1277 






1278 


(vi) 


ORIGINAL SOURCE: 


1279 




(A) ORGANISM: Arabidopsis thaliana 


1280 




(B) STRAIN: ecotype Columbia 


1281 






1282 


(ix) 


FEATURE : 


1283 




(A) NAME/KEY: CDS 


1284 




(B) LOCATION: 1. .2 04 


1285 






1286 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 26 


1287 






1288 


AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG < 




PAGE: 15 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:38 

INPUT SET: S33211.raw 

--> 1289 ACCTGGTGCT 60 

1290 

1291 ACGCTTGGTG AGCTTTACTA CAGAATTNCA GAGAAGAGCA AAATCCATGG 
--> 1292 ATTTCCNGCG 12 0 
1293 

1294 GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA 
--> 1295 CGGTACCTTG 180 
1296 

1297 ATGAGGAAGT ATGGTCTTNC GGGA 204 
1298 
1299 

1300 (2) INFORMATION FOR SEQ ID NO: 27: 
1301 

1302 (i) SEQUENCE CHARACTERISTICS: 

--> 1303 (A) LENGTH: 491 base pairs 

1304 (B) TYPE: nucleic acid 

1305 (C) STRANDEDNESS : double 

1306 (D) TOPOLOGY: linear 
1307 

1308 (ii) MOLECULE TYPE: cDNA to mRNA 

1309 

1310 (iii) HYPOTHETICAL: NO 

1311 

1312 (iii) ANTI-SENSE: NO 

1313 

1314 (vi) ORIGINAL SOURCE: 

1315 (A) ORGANISM: Arabidopsis thaliana 

1316 (B) STRAIN: ecotype Columbia 
1317 

1318 (ix) FEATURE: 

1319 (A) NAME/KEY: CDS 

1320 (B) LOCATION: 2.. 491 
1321 

1322 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

1323 

1324 GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG CCATTGGGTG 

1325 6GAAAATCAG 60 
1326 

1327 CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT 
--> 1328 ATAATATACA 120 
1329 

13 30 GTACATGGTG AAATGGAAAG TGAATGANGT CGAGGAGATG AACAAACATG 
--> 1331 TCAGGTGGAT 180 
1332 

1333 GAGATCGTTA CACGATTACA TGACTCCGTA TGTTTCTAAA TCGCCGAGAG 

1334 GAGCTTATTT 240 

1335 

13 3 6 GANTTACAGA GATCTTGATT TGGGCTCGAC CAAAGGGATT AACACGGGTT 
--> 1337 TCGGAGATGC 300 
1338 

1339 AAGGAAATGG NNGGGTGAGN CTTTTTTCAA AGGTAATTTC CAAGGGGTTA 
--> 1340 GGTTTTGGTT 360 




PAGE: 16 



- - > 



- - > 



1341 
1342 
1343 
1344 
1345 
1346 
1347 
1348 
1349 
1350 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:39 



INPUT SET: S33211.raw 



AAAGGGGAGG TTTNNCCCAN CAAATTTTTT TTCAGGANCC GGCCANGNTT 
TTCCCCCCCC 420 

TNTTTTTNGG NCCCCAATCN AAANCCCCGT TTTAAAAGGG GGGCCATTTC 
NTTTTTTNCA 480 

NNTTAAAAGG G 




491 



- -> 



- - > 



1351 
1352 
1353 
1354 
1355 
1356 
1357 
1358 
1359 
1360 
1361 
1362 
1363 
1364 
1365 
1366 
1367 
1368 
1369 
1370 
1371 
1372 
1373 
1374 
1375 
1376 
1377 
1378 
1379 
1380 
1381 
1382 
1383 
1384 
1385 
1386 
1387 
1388 
1389 
1390 
1391 
1392 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 407 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG TCAACGGTAC GAAGCCTGGT 
GAGAATAC6G 60 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG GCCGGTCGGA TAAGCTGTTG 
ACCGTNATGA 120 

ACCGGGATTT CCCGGAGTTG AAGCTGAAGA AAACCGATTN TACCGAGATG 
AGATGGATCG 180 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG TTGGTACACC GACTTCTGTG 
CTACTAAATC 240 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC GAAAATCGGA CTACGTGAAG 
CGTCTNATTT 300 

TCGAGAACCC GATCTCNNGT TTGATACTCA AGAAATTTGT AGAGGTTNNG 
AAAGTTAAAA 360 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:39 



INPUT SET: S33211.raw 

13 93 TNAATTTGGA TCCGCATTNN GGNANNNATG GTGAAACCCC NNGTTNT 407 

1394 

1395 





13 96 


\2) INFORMATION FOK bbs^ ID NO: : 




13 97 






13 98 




- - > 


1 ^ ft o 

13 99 


(A) XjaNGTH : Jou oa.se paxrs 




1400 


(B) TYPE: nucleic acid 




140 1 


\C} STRANDriDiNhtfab : uOUDIe 




1402 


[u) topoIjOGY : linear 




1403 






1404 


^li; MOLiECUIjE lYFb: CDNA uO TtlKNA 




1405 






1406 


(ill) HYPOTHETICAL: NO 




1407 






1408 


(ill) ANTI-SENSE: NO 




1409 






1410 


(vi) ORIGINAL SOURCE: 




1411 


(A) ORGANISM: Araoiaopsis tnaliana 




1412 


(B) STRAIN: ecotype Columbia 




1413 






1414 


(ix) FEATURE: 




1415 


(A) NAME/KEY: CDS 




1416 


(B) LOCATION: 3.. 3 60 




1417 






1418 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 




1419 






1420 


ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC 


- - > 


1421 


GTCACCGTTT 6 0 




1422 






1423 


m T\ TV n m^my^ ✓-s /-i 71 y^/^ mm TV TV TV TV TV TV TV m/^ f^r^ TV r^f^f^ TV m TV m TV mrri/^ tv f^f^f^f^ A ^ 

TCAGAATCTC CCGGACGTTA GAACAAAATG CGACGGATAT CATTCACCGG 


--> 


1424 


TGGCAACAAG 120 




1425 






1426 


TTGCACCGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA / 


- ~> 


1427 


NAAACGGCAC 180 




1428 






1429 


TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAGC AATGTTTCTA 


--> 


1430 


GGAGACACGN 240 




1431 






1432 


CAACTCTACT GTCGATATTA AACCGGAGAT TCCCAGAATT GGGTTTGGTC 


- - > 


1433 


CGGTCTGACT 300 




1434 






1435 


GTACCGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA 


- - > 


1436 


TCCCAGGTTG 360 




1437 






1438 






1439 


(2) INFORMATION FOR SEQ ID NO : 30: 




1440 






1441 


(i) SEQUENCE CHARACTERISTICS: 


- - > 


1442 


(A) LENGTH: 427 base pairs 




1443 


(B) TYPE: nucleic acid 



PAGE: 18 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:39 

INPUT SET: S33211.raw 

1444 (C) STRANDEDNESS : double 

1445 (D) TOPOLOGY: linear 
1446 

1447 (ii) MOLECULE TYPE: cDNA to mRNA 

1448 

1449 (iii) HYPOTHETICAL: NO 

1450 

1451 (iii) ANTI-SENSE: NO 

1452 

1453 (vi) ORIGINAL SOURCE: 

1454 (A) ORGANISM: Arabidopsis thaliana 

1455 (B) STRAIN: ecotype Columbia 
1456 

1457 (ix) FEATURE: 

1458 (A) NAME/KEY: CDS 

1459 (B) LOCATION: 3.. 427 
1460 

1461 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

1462 

1463 TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT 
--> 1464 TCTAAGTGGC 60 
1465 

1466 AACAAATTTC ATCCAAGATT ATTGAAGAGA TACACATCCG AGTGGTACTC 
--> 1467 AGAGCAGCTG 120 
1468 

1469 GAAATGATGG AAACAAGACT GTGACAATGA CCTACCTAGG TCAGTTTCTT 
--> 1470 GGCGAGAAAG 180 
1471 

1472 GCACCTTGCT GAAGGTTATG GAGAAGGCTT TTCCAGAACT AGGGTTAACT 
--> 1473 CAAAAGGATT 240 
1474 

1475 GTACTGAAAT GAGCTGGATT GAAGCCGCCC TTTTCCATGG TGGRTTTCCA 
--> 1476 ACAGGKTCTC 300 
1477 

1478 CTATTGAAAT TTTGCTTMAG CTCAAGTCGC CTYTAGGAAA AGRTTWCTTC 
--> 1479 AAAGCAACGK 360 
1480 

1481 CGGATTTCGT TAAAGAACCT WTTCCTGTGA TAGGGCTCAA AGGAATATTC 
--> 1482 AAAAGATTGA 420 
1483 

1484 TTGAAGG 427 
1485 
1486 




1487 (2) INFORMATION FOR SEQ ID NO : 31: 
1488 

1489 (i) SEQUENCE CHARACTERISTICS: 

1490 (A) LENGTH: 437 base pairs 

1491 (B) TYPE: nucleic acid 

1492 (C) STRANDEDNESS : double 

1493 (D) TOPOLOGY: linear 
1494 

1495 (ii) MOLECULE TYPE: cDNA to mRNA 



PAGE: 19 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:40 

INPUT SET: S33211.raw 

1496 

1497 (iii) HYPOTHETICAL: NO 

1498 

1499 (iii) ANTI-SENSE: NO 

1500 (vi) ORIGINAL SOURCE: 

1501 (A) ORGANISM: Arabidopsis thaliana 

1502 (B) STRAIN: ecotype Columbia 
1503 

1504 (ix) FEATURE: ^ 

1505 (A) NAME/KEY: CDS 

1506 (B) LOCATION: 1..437 
1507 

1508 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

1509 

1510 GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC 
--> 1511 CGNGTTTAAA 60 

1512 

1513 GTCCAGAAAC ATNAGGAGAA AGAGGCCGTT AGGNTCATCA ACAAGTGGCA 
--> 1514 GTATGTTGCG 120 
1515 

1516 GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA 
--> 1517 CGGAAACTCT 180 
1518 

1519 GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNCCGGTGA ATAATNTCTT 
--> 1520 GGCCTTGATG 240 
1521 

1522 GAAGAAAAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT 
--> 1523 GAGTTGGGCT 300 
1524 

1525 GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT 
--> 1526 GTTCCNCGCG 360 
1527 

1528 GATCGGNAGC GGNCAATTTN TGGNCTTTCA AGGGGAAAGN CGGCTTTTTN 
--> 1529 CAAGAACCCG 420 
1530 

1531 NTACCCGGGG TTCAATT 43 7 

1532 
1533 




1534 


(2) INFORMATION FOR SEQ ID NO : 32: 


1535 






1536 


(i) 


SEQUENCE CHARACTERISTICS: 


1537 




(A) LENGTH: 441 base pairs 


1538 




(B) TYPE: nucleic acid 


1539 




(C) STRANDEDNESS : double 


1540 




(D) TOPOLOGY: linear 


1541 






1542 


(ii) 


MOLECULE TYPE: cDNA to inRNA 


1543 






1544 


(iii) 


HYPOTHETICAL: NO 


1545 






1546 


(iii) 


ANTI- SENSE: NO 


1547 







PAGE: 20 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/ 09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:40 



INPUT SET: S33211.raw 

1548 (vi) ORIGINAL SOURCE: 

1549 (A) ORGANISM: Arabidopsis thaliana 
1550 

1551 ( ix ) FEATURE : 

1552 (A) NAME/KEY: CDS 

1553 (B) LOCATION: 1..441 
1554 

1555 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

1556 

1557 GCGGACCCTA TAGATCANNA TGTGCTACTG ANAGAAGAGG AAGCCAAGAA 

1558 CAAGCCGGAG 60 
1559 

1560 ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC 

1561 ATATGTTTCG 120 
1562 

1563 AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT 

1564 GTATCTTGGA 180 
1565 

156 6 AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG TGAAGTATTT 
1567 CAAGAACAAT 240 
1568 

1569 TTCGAGAGAT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT 

1570 CTGCGATGAA 300 
1571 

1572 CAGAGCATTC CTCTGGTGAA CAAAGTTACC TGAAGATATC ATTTGAAGTT 

1573 TTTTATTAGT 360 
1574 

1575 CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT 

1576 GTGTAACTTA 420 
1577 

1578 TGTGTGTGAT TGTGAATTGT G 441 
1579 
1580 




1581 


(2) INFORMATION FOR SEQ ID NO: 33: 


1582 






1583 


(i) 


SEQUENCE CHARACTERISTICS: 


1584 




(A) LENGTH: 502 base pairs 


1585 




(B) TYPE: nucleic acid 


1586 




(C) STRANDEDNESS : double 


1587 




(D) TOPOLOGY: linear 


1588 






1589 


(ii) 


MOLECULE TYPE: cDNA to mRNA 


1590 






1591 


(iii) 


HYPOTHETICAL: NO 


1592 






1593 


(iii) 


ANTI- SENSE: NO 


1594 






1595 


(vi) 


ORIGINAL SOURCE: 


1596 




(A) ORGANISM: Arabidopsis thaliana 


1597 




(B) STRAIN: ecotype Columbia 


1598 






1599 


(ix) 


FEATURE : 



PAGE: 21 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258, 031 A 



DATE: 09/03/1999 
TIME: 14:51:41 



INPUT SET: S33211.raw 



--> 



--> 



- - > 



1600 
1601 
1602 
1603 
1604 
1605 
1606 
1607 
1608 
1609 
1610 
1611 
1612 
1613 
1614 
1615 
1616 
1617 
1618 
1619 
1620 
1621 
1622 
1623 
1624 
1625 
1626 
1627 
1628 
1629 
1630 
1631 



(A) NAME/KEY: CDS 

(B) LOCATION: 2. .502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGTACACCCA 
AGAGCATCCT 60 

TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT 
TTGTCAAAAA 120 

ACCAATACCC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT 
TCAACAGTAG 180 

CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG 
CAACGGCCAC 240 

CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA 
TGTGGTTTGA 300 

CGCAAACGCC ACACAGAGTA GCCNGGCTAT GATGAATGAG CTTTTTGAGG 
TGGCGGGACC 360 

GTACGTGNGT CAAGTAAACC CGAGANANGG CTTCCTTTAA NTTCAGAGNC 
CATCGNTNTT 420 

NGGAGCAANN CCAAGTGGGG GGGNCCAACC GGGGGNTNAA ANCNNAGNTC 
TTNGGGGGCC 480 

CAGAATTTCC TTNGGGGAAT TT 




502 



1632 


(2) INFORMATION FOR SEQ ID NO : 34: 


1633 






1634 


(i) 


SEQUENCE CHARACTERISTICS: 


1635 




(A) LENGTH: 400 base pairs 


1636 




(B) TYPE: nucleic acid 


1637 




(C) STRANDEDNESS : double 


1638 




(D) TOPOLOGY: linear 


1639 






1640 


(ii) 


MOLECULE TYPE: cDNA to mRNA 


1641 






1642 


(iii) 


HYPOTHETICAL: NO 


1643 






1644 


(iii) 


ANTI- SENSE: NO 


1645 






1646 


(vi) 


ORIGINAL SOURCE: 


1647 




(A) ORGANISM: Arabidopsis thai; 


1648 




(B) STRAIN: ecotype Columbia 


1649 






1650 


(ix) 


FEATURE : 


1651 




(A) NAME/KEY: CDS 



PAGE: 22 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:41 

INPUT SET: S33211.raw 

1652 (B) LOCATION: 2.. 400 

1653 

1654 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

1655 

1656 NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC ACCATTGGTT 
--> 1657 TCCCAAGAAN 60 
1658 

1659 CCCGAGACAA CCGTTTTTCA ATNACCGTGA TGTTGATTTG GGTATTAATT 
--> 1660 CTCATAATGG 120 

1661^ 

1662 TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG 
--> 1663 CAGGTAATTT 180 
1664 

1665 CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA 
--> 1666 GGAACGAACA 240 
1667 

1668 GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA 
--> 1669 AATTTGTTGT 300 
1670 

1671 TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AATTTATAAA 
--> 1672 GTTTGATATT 360 

1673 

1674 AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 400 
1675 
1676 



- - > 




■ -> 



1677 


(2) INFORMATION FOR SEQ ID NO : 35: 


1678 






1679 


(i) 


SEQUENCE CHARACTERISTICS: 


1680 




(A) LENGTH: 383 base pairs 


1681 




(B) TYPE: nucleic acid 


1682 




(C) STRANDEDNESS : double 


1683 




(D) TOPOLOGY: linear 


1684 






1685 


(ii) 


MOLECULE TYPE: CDNA tO itlRNA 


1686 






1687 


(iii) 


HYPOTHETICAL: NO 


1688 






1689 


(iii) 


ANTI- SENSE: NO 


1690 






1691 


(vi) 


ORIGINAL SOURCE: 


1692 




(A) ORGANISM: Arabidopsis thaliana 


1693 




(B) STRAIN: ecotype Columbia 


1694 






1695 


(ix) 


FEATURE : 


1696 




(A) NAME/KEY: CDS 


1697 




(B) LOCATION: 2 . .3 83 


1698 






1699 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 


1700 






1701 


CAACTCTAAT GGGAACACCT ACTTCGATCG AATGTCGATG ( 


1702 


TTTTCTGGGC 60 


1703 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:41 



INPUT SET: S33211.raw 



- - > 



•-> 



•-> 



1704 
1705 
1706 
1707 
1708 
1709 
1710 
1711 
1712 
1713 
1714 
1715 
1716 
1717 
1718 
1719 
1720 
1721 



GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA 
TCCGGTTGGT 120 

TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG CGTCGGAAAA ACCGTCGGAG 
AAGGAGCCGT 180 

TGATCTTATA ATGAAGTGGC AGAACTTCTC TCATAGTACG GNTCGGAATT 
TNTTTGTGAA 240 

GCTGANTTTT GANTTTAGTC AACGGTGCAA AGCCGGGTGA AAAAAAGGTT 
TTAGNGNCTT 300 

TCANTTTGGN TGNAANCTTG GGGGTTTTAT NAGAACGGTT AACCGGGATT 
NANCCCGNGT 360 

TTTCCCGGGG TTAAAACCTT NGG 




383 



1722 


(2) INFORMATION FOR SEQ ID NO : 36: 




1723 








1724 


(i) 


SEQUENCE CHARACTERISTICS: 




1725 




(A) LENGTH: 354 base pairs 




1726 




(B) TYPE: nucleic acid 




1727 




(C) STRANDEDNESS: double 




1728 




(D) TOPOLOGY: linear 




1729 








1730 


(ii) 


MOLECULE TYPE: cDNA to lURNA 




1731 








1732 


(iii) 


HYPOTHETICAL: NO 




1733 








1734 


(iii) 


ANTI- SENSE: NO 




1735 








1736 


(vi) 


ORIGINAL SOURCE: 




1737 




(A) ORGANISM: Arabidopsis thaliana 


1738 




(B) STRAIN: ecotype Columbia 




1739 








1740 


(ix) 


FEATURE : 




1741 




(A) NAME/KEY: CDS 




1742 




(B) LOCATION: 1. .354 




1743 








1744 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 36: 


1745 








1746 


ATCAATGTTC TTACTAAACG TACACGAGCA TCGTTGGCTT 


TCAAGGCTAA 


1747 


ATCTGATTTT 60 




1748 








1749 


NTTCAAGAAC CGATNCCTAA AACCGGGATT TCGAAGCTTT 


GGAGACGGTT 


1750 


GCAAGAACCG 120 




1751 








1752 


GAAGCAGAGC ATGCTCAGCT AATTTNCACN CCATTTGGTG 


GTAAAATGAG 


1753 


TNAGATTGCA 180 




1754 








1755 


GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT 


ATNAGATTCA 



PAGE: 24 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:42 




INPUT SET: S33211.raw 

1756 GTACTTGAAT 240 

1757 

1758 TACTGGAGAG GAGACGTGAA AGAGAAGTAT ATTGAGATNG GTGGAGGAGA 

1759 GTTTACGGTT 300 
1760 

1761 GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT 

1762 354 
1763 
1764 

1765 (2) INFORMATION FOR SEQ ID NO: 37: 
1766 

1767 (i) SEQUENCE CHARACTERISTICS: 

1768 (A) LENGTH: 403 base pairs 

1769 (B) TYPE: nucleic acid 

1770 (C) STRANDEDNESS : double 

1771 (D) TOPOLOGY: linear 
1772 

1773 (ii) MOLECULE TYPE: CDNA to mRNA 

1774 

1775 (iii) HYPOTHETICAL: NO 

1776 

1777 (iii) ANTI-SENSE: NO 

1778 

1779 (vi) ORIGINAL SOURCE: 

1780 (A) ORGANISM: Arabidposis thaliana 

1781 (B) STRAIN: Bcotype Columbia 
1782 

1783 (ix) FEATURE: 

1784 (A) NAME/KEY: CDS 

1785 (B) LOCATION: 2.. 403 
1786 

1787 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

1788 

1789 TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GCCACAAAAG 

1790 TATCTGCAAT 60 
1791 

1792 CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA 

1793 AAGTGAAATC 12 0 
1794 

1795 AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA 

1796 TGAACCAGAA 180 
1797 

1798 GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA 

1799 TCGAATCGAT 240 
1800 

1801 AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA 

1802 GAGACCTAAG 300 
1803 

1804 GATTCGAGGA TCAGTATTTC AAAGCAAAGT CAGGATTATG GTTCAAAAAC 

1805 CCGTTCCTGA 360 
1806 

1807 AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 403 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:42 







INPUT SET: S33211.raw 




1808 






1809 






1810 


(2) INFORMATION FOR SEQ ID NO : 38: 




1811 






1812 


(i) SEQUENCE CHARACTERISTICS: 


- -> 


1813 


(A) LENGTH: 260 base pairs 




1814 


(B) TYPE: nucleic acid 




1815 


(C) STRANDEDNESS : double 




1816 


(D) TOPOLOGY: linear 




1817 






1818 


(ii) MOLECULE TYPE: cDNA to mRNA 




1819 






1820 


(iii) HYPOTHETICAL: NO 




1821 






1822 


(iii) ANTI-SENSE: NO 




1823 






1824 


(vi) ORIGINAL SOURCE: 




1825 


(A) ORGANISM: Arabidopsis thaliana 




1826 


(B) STRAIN: ecotype Columbia 




1827 


(ix) FEATURE: 




1828 


(A) NAME/KEY: CDS 




1829 


(B) LOCATION: 1. .260 




1830 






1831 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 




1832 






1833 


GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG 


- - > 


1834 


AGAATCTCTT 60 Ijlr^ 




1835 


Ir 




1836 


NGTGTTCTCA CGAATCGTAA GCGTACATCT CTATCTTTNA AAGGCAAAGA 


- - > 


1837 


TGATTTTATC 120 




1838 






1839 


CAAGAACCGA TACCCGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA 


- - > 


1840 


AGCCCCCNAG 180 




1841 






1842 


GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA 


- - > 


1843 


AATGGCAGAG 24 0 




1844 






1845 


TACGTANCAC CATTCCCACA 260 




1846 






1847 






1848 


(2) INFORMATION FOR SEQ ID NO : 39: 




1849 






1850 


(i) SEQUENCE CHARACTERISTICS: 


- - > 


1851 


(A) LENGTH: 605 base pairs 




1852 


(B) TYPE: nucleic acid 




1853 


(C) STRANDEDNESS: double 




1854 


(D) TOPOLOGY: linear 




1855 






1856 


(ii) MOLECULE TYPE: cDNA to mRNA 




1857 






1858 


(iii) HYPOTHETICAL: NO 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:42 



- - > 



- -> 



- -> 



-> 



- - > 



- -> 



- - > 



1859 
1860 
1861 
1862 
1863 
1864 
1865 
1866 
1867 
1868 
1869 
1870 
1871 
1872 
1873 
1874 
1875 
1876 
1877 
1878 
1879 
1880 
1881 
1882 
1883 
1884 
1885 
1886 
1887 
1888 
1889 
1890 
1891 
1892 
1893 
1894 
1895 
1896 
1897 
1898 
1899 
1900 
1901 
1902 
1903 
1904 



INPUT SET: S3321Lraw 



(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAACCA CTCCCTACAA 
TTTTTTGTAT 60 

TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT 
CATCAGCCTC 120 

CATCCAAGAT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT 
TTCCACTCGA 180 

GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC 
TTGANTCGAC 24 0 

GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA 
TATTCAGACC 300 

GATTCACCAG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG 
GGNTTCATTT 360 

TNGTGTTTGA NGTGGCGGTC ACGATTTTCG AGGCCTTTGT NTTTATGTTT 
CACGGTTTGA 42 0 

AAAAACCGTT TATATTACTC GGCCTGTCAA ANTTGNANNC AAAATCANAT 
GTTGGATATT 480 

GNATTCCAAA TAGGTNCTTG GGGTNAACCT GGTGGCTANC GTTTGGTGAG 
CTTTTACTTT 540 

CAAGAATTTG CANGNGGANG TGCAAAGATT CCATGGGATT TCCCGGGGGG 
TTTNTTGCAC 600 

AATGT 




605 



1905 (2) INFORMATION FOR SEQ ID NO: 40: 
1906 

1907 (i) SEQUENCE CHARACTERISTICS: 

1908 (A) LENGTH; 464 base pairs 

1909 (B) TYPE: nucleic acid 

1910 (C) STRANDEDNESS : double 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:43 



1911 
1912 
1913 
1914 
1915 
1916 
1917 
1918 
1919 
1920 
1921 
1922 
1923 
1924 
1925 
1926 
1927 
1928 
1929 
1930 
1931 
1932 
1933 
1934 
1935 
1936 
1937 
1938 
1939 
1940 
1941 
1942 
1943 
1944 
1945 
1946 
1947 
1948 
1949 
1950 
1951 
1952 



INPUT SET: S3321Lraw 



( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: CDNA to ttlRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 40: 

AACACAAAAC TCTTCCATTT GGCTTCTCTC TTGCATATTC GTTGCAAGGA 
TGGGAAATTC 60 

AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN 
NATTCTATTG 120 

CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT 
GTGTCAAAAG 180 

GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG 
AAAACGGCTC 240 

TNTTATTCAA CGGGTCCNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG 
NAAAATCCAT 300 

GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG 
NTTCCATCAT 360 

TTGTTCAAAG GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN 
GGTTTCGGGG 420 

CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 




464 



1953 (2) INFORMATION FOR SEQ ID NO : 41: 
1954 

1955 (i) SEQUENCE CHARACTERISTICS: 

1956 (A) LENGTH; 386 base pairs 

1957 (B) TYPE: nucleic acid 

1958 (C) STRANDEDNESS : double 

1959 (D) TOPOLOGY: linear 
1960 

1961 (ii) MOLECULE TYPE: cDNA tO mRNA 
1962 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:43 



INPUT SET: S33211.raw 



- - > 



- - > 



1963 
1964 
1965 
1966 
1967 
1968 
1969 
1970 
1971 
1972 
1973 
1974 
1975 
1976 
1977 
1978 
1979 
1980 
1981 
1982 
1983 
1984 
1985 
1986 
1987 
1988 
1989 
1990 
1991 
1992 
1993 
1994 
1995 
1996 
1997 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 

TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANCCGTTTAC 

NGAATTTTAT 60 

GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT 

CAAGGATATG 120 

GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA 

GAGTTGG6GA 180 

GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TGAAGACTAG 

GGTTGATCCN 240 

ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC 

CTGAAGATAT 300 

CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT 

GTTGAATANT 360 

ATGCGTCAAG TGTGTAACTT ATGTGT 




386 



1998 (2) INFORMATION FOR SEQ ID NO: 42: 
1999 

20 00 (i) SEQUENCE CHARACTERISTICS: 

--> 2001 (A) LENGTH: 377 base pairs 

2002 (B) TYPE: nucleic acid 

2003 (C) STRANDEDNESS : double 

2004 (D) TOPOLOGY: linear 
2005 

2 006 (ii) MOLECULE TYPE: cDNA to mRNA 
2007 

2008 (iii) HYPOTHETICAL: NO 
2009 

2010 (iii) ANTI-SENSE: NO 
2011 

.2012 (vi) ORIGINAL SOURCE: 

2013 (A) ORGANISM: Arabidopsis thaliana 

2014 (B) STRAIN: ecotype Columbia 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258, 031 A 



DATE: 09/03/1999 
TIME: 14:51:44 



INPUT SET: S33211.raw 

2015 

2 016 (ix) FEATURE: 

2 017 (A) NAME/KEY: CDS 

2018 (B) LOCATION: 1..377 

2019 

2020 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

2021 

2 022 TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG 
2 023 AGTAGTATGG 60 

2024 TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG 
--> 2025 GCTGGTTAAC 120 
2026 

2027 TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA 
--> 2028 CAGATTGGTG 180 
2029 

2 03 0 AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG 
--> 2 031 TATCCCACCT 240 

2032 

203 3 ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA 
--> 2034 AGAATAAATG 300 

2035 

2 036 ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC 
--> 2 037 ACTACTCTAA 360 

2038 

203 9 ATAAAATGTC ATTTACC 3 77 

2040 
2041 




2 042 (2) INFORMATION FOR SEQ ID NO : 43: 
2 043 

2044 (i) SEQUENCE CHARACTERISTICS: 

--> 2045 (A) LENGTH: 377 base pairs 

2046 (B) TYPE: nucleic acid 

2047 (C) STRANDEDNESS: double 

2048 (D) TOPOLOGY: linear 
2049 

2050 (ii) MOLECULE TYPE: cDNA to mRNA 
2051 

2052 (iii) HYPOTHETICAL: NO 
2053 

2054 (iii) ANTI-SENSE: NO 
2055 

2 056 (vi) ORIGINAL SOURCE: 

2057 (A) ORGANISM: Arabidopsis thaliana . 

2058 (B) STRAIN: ecotype. Columbia 
2059 

2060 (ix) FEATURE: 

2061 (A) NAME/KEY: CDS 

2062 (B) LOCATION: 1..377 
2063 

2064 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
2065 

2066 TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258, 031 A 



DATE: 09/03/1999 
TIME: 14:51:44 



INPUT SET: S3321Lraw 

--> 2067 AGTAGTATGG 60 

2068 

2069 TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG 
--> 2070 GCTGGTTAAC 120 
2071 

2 072 TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA 
--> 2073 CAGATTGGTG 180 
2074 

2 075 AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG 
--> 2076 TATCCCACCT 240 
2077 

2078 ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA 
--> 2079 AGAATAAATG 300 
2080 

2081 ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC 
--> 2082 ACTACTCTAA 360 
2083 

2 084 ATAAAATGTC ATTTACC 377 
2085 
2086 




2087 (2) INFORMATION FOR SEQ ID NO: 44: 
2088 

2089 (i) SEQUENCE CHARACTERISTICS: 

--> 2090 (A) LENGTH: 346 base pairs 

2091 (B) TYPE: nucleic acid 

2092 (C) STRANDEDNESS : double 

2093 (D) TOPOLOGY: linear 
2094 

2095 (ii) MOLECULE TYPE: CDNA tO mRNA 
2096 

2097 (iii) HYPOTHETICAL: NO 
2098 

2099 (iii) ANTI-SENSE: NO 
2100 

2101 (vi) ORIGINAL SOURCE: 

2102 (A) ORGANISM: Arabidopsis thaliana 

2103 (B) STRAIN: ecotype Columbia 
2104 

2105 (ix) FEATURE: 

2106 (A) NAME/KEY: CDS 

2107 (B) LOCATION: 2.. 346 
2108 

2109 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
2110 

2111 GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC 

--> 2112 CGAATCTTTT 60 
2113 

2114 TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA 

--> 2115 GAGCTTCTNT 120 
2116 

2117 GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTCCTTAGTA 

--> 2118 AGGAGTTTCC 180 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/ 09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:44 



INPUT SET: S33211.raw 

2119 

2120 TGAATTGGGT TTAAAGAAGG AGAATTNTTC GGAGATGACT TGGTTTCANT 

2121 CTGCTTTATG 240 
2122 

2123 GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATCCTAAA GTNTTTCTTG 

2124 ATCGGAATCT 300 
2125 

2126 CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 346 
2127 
2128 




- - > 



- - > 



- - > 



2129 


(2) INFORMATION FOR SEQ ID NO: 45: 




2130 








2131 


(i) 


SEQUENCE CHARACTERISTICS: 




2132 




(A) LENGTH: 261 base pairs 




2133 




(B) TYPE: nucleic acid 




2134 




(C) STRANDEDNESS : double 




2135 




(D) TOPOLOGY: linear 




2136 








2137 


(ii) 


MOLECULE TYPE: CDNA to mRNA 




2138 








2139 


(iii) 


HYPOTHETICAL: NO 




2140 








2141 


(iii) 


ANTI- SENSE: NO 




2142 








2143 


(vi) 


ORIGINAL SOURCE: 




2144 




(A) ORGANISM: Arabidopsis thaliana 


2145 




(B) STRAIN: ecotype Columbia 




2146 








2147 


(ix) 


FEATURE : 




2148 




(A) NAME/KEY: CDS 




2149 




(B) LOCATION: 2 . .261 




2150 








2151 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 45: 


2152 








2153 


ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA 


GGTTTGGTTA 


2154 


AAGGGAAGNT 60 




2155 








2156 


TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT 


CCTCTGTTTT 


2157 


GAGTCCTCAA 120 




2158 








2159 


TACAAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA 


TTATAATAGA 


2160 


TAATGTAACT 180 




2161 








2162 


TTCTGCTACA ATTGTAAAAG TGAGATGTAC CCAATACGGT 


TTAAGCGGAC 


2163 


CGAGAATAGT 240 




2164 








2165 


CAATTCAAAG ACCAAATTCT G 




2166 








2167 










261 



2168 (2) INFORMATION FOR SEQ ID NO: 46: 
2169 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:45 



INPUT SET: S3321Lraw 

2170 (i) SEQUENCE CHARACTERISTICS: 

--> 2171 (A) LENGTH: 478 base pairs 

2172 (B) TYPE: nucleic acid 

2173 (C) STRANDEDNESS : double 

2174 (D) TOPOLOGY: linear 
2175 

2176 (ii) MOLECULE TYPE: CDNA to mRNA 

2177 

2178 (iii) HYPOTHETICAL: NO 

2179 

2180 (iii) ANTI-SENSE: NO 

2181 

2182 (vi) ORIGINAL SOURCE: 

2183 (A) ORGANISM: Arabidopsis thaliana 

2184 (B) STRAIN: ecotype Columbia 
2185 

2186 (ix) FEATURE: 

2187 (A) NAME/KEY: CDS 

2188 (B) LOCATION: 1..478 
2189 

2190 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

2191 

2192 GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTCACCGANT 

--> 2193 CAACGAGGAC 60 
2194 

2195 GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT 

--> 2196 GATTTCTTCC 120 
2197 

2198 ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT 

--> 2199 CATCACTCCG 180 
2200 

2201 GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG 

--> 2202 GGTATACACA 240 
2203 

22 04 ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG 

--> 2205 GCTTAAAAAN 300 
2206 

2207 CANNCCGTTC GTTNNTCATT GATTTNNAGA AATCTTCCGG GCTTATTTAA 

- -> 2208 CATNTAAGAT 360 
2209 

2210 GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACAAA 

--> 2211 NTTNGGGGGA 420 

2212 

2213 ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC 

--> 2214 CGGCCGGT 478 
2215 
2216 




2217 (2) INFORMATION FOR SEQ ID NO: 47: 
2218 

2219 (i) SEQUENCE CHARACTERISTICS: 

--> 2220 (A) LENGTH: 579 base pairs 

2221 (B) TYPE: nucleic acid 



PAGE: 33 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:45 

INPUT SET: S33211.raw 

2222 (C) STRANDEDNESS : double 

2223 (D) TOPOLOGY: linear 
2224 

2225 (ii) MOLECULE TYPE: CDNA to mRNA 

2226 

2227 (iii) HYPOTHETICAL: NO 

2228 

2229 (iii) ANTI-SENSE: NO 

2230 

2231 (vi) ORIGINAL SOURCE: 

2232 (A) ORGANISM: Arabidopsis thaliana 

2233 (B) STRAIN: ecotype Columbia 
2234 

2235 (ix) FEATURE: 

2236 (A) NAME/KEY: CDS 

2237 (B) LOCATION: 2.. 579 
2238 

2239 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

2240 

2241 GGCCGTTAGG ATCATCAAGA AATGGCAATA TGCTGCAGAT AAGGTTCCTG 
--> 2242 ATGATCTTTT 60 
2243 

2244 CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT 
--> 2245 TCACTGGACT 120 
2246 

2247 ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC 
--> 2248 CGGAACTAGG 180 
2249 

2250 TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT 
--> 2251 GGTTTGCTGA 240 
2252 

2253 TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAGCGTACAT 
--> 2254 CTCTATCTTT 300 
2255 

2256 CAAAGGCAAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC 
--> 2257 AAGAGATATG 360 
2258 

2259 GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC 
--> 2260 CATTTGGGTG 420 
2261 

2262 NGGNAAAATG AGTGAAATGG CAGAGNCCGA ACCACCAATT CCCACANNCG 

2263 AGGGAGGGGA 480 
2264 

2265 ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG GNNNNNAAGN GGGNGCCAGN 
--> 2266 ACCAANCCGG 540 
2267 

2268 GNCNGTAAAN CNTGNAATGG GCCNAACCCG TNCCGGATT 579 
2269 
2270 

2271 (2) INFORMATION FOR SEQ ID NO : 48: 
2272 

2273 (i) SEQUENCE CHARACTERISTICS: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:45 



INPUT SET: S33211.raw 

--> 2274 (A) LENGTH: 252 base pairs 

2275 (B) TYPE: nucleic acid 

2276 (C) STRANDEDNESS : double 

2277 (D) TOPOLOGY: linear 
2278 

227 9 (ii) MOLECULE TYPE: CDNA to mRNA 

2280 

2281 (iii) HYPOTHETICAL: NO 

2282 

2283 (iii) ANTI-SENSE: NO 

2284 

2285 (vi) ORIGINAL SOURCE: 

2286 (A) ORGANISM: Oryza sativa 

2287 (B) STRAIN: Nipponbare, subsp. japonica 

2288 (D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 
2289 

22 90 (ix) FEATURE: 

22 91 (A) NAME/KEY: CDS 

2292 (B) LOCATION: 3.. 252 

2293 

2294 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

2295 

22 96 TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN CGACGACGGT GACGGTGTTC 
--> 22 97 GTCGTCGGGA 60 
2298 

22 99 GGAACGTCGA CCAGGGCGCC GCNGACGTCG TCGCCAGATG GCAAGACGTC 
--> 2300 GCGCCGAGCC 120 

2301 

23 02 TCCCTCCCGA GCTCACCATA CGGGTGATCG TNCGAGGGCA GCGCGCCACG 
--> 2303 TTCCAGTCGC 180 

2304 

2 3 05 TGTACCTCGG CTCGTGCGCC GACCTGGTGC CGACGATGAG CAGCATGTTC 
--> 2306 CCGGAGCTCG 240 
2307 

23 08 GGATGACGAT TG 252 
2309 
2310 




2575 


(2) INFORMATION FOR SEQ ID NO: 57; 


2576 






2577 


(i) 


SEQUENCE CHARACTERISTICS: 


2578 




(A) LENGTH: 1981 base pairs 


2579 




(B) TYPE: nucleic acid 


2580 




(C) STRANDEDNESS : double 


2581 




( D ) TOPOLOGY : 1 inear 


2582 






2583 


(ii) 


MOLECULE TYPE: cDNA to mRNA 


2584 






2585 


(iii) 


HYPOTHETICAL: NO 


2586 






2587 


(iii) 


ANTI- SENSE: NO 


2588 






2589 


(vi) 


ORIGINAL SOURCE: 
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2590 
2591 
2592 
2593 
2594 
2595 
2596 
2597 
2598 
2599 
2600 
2601 
2602 
2603 
2604 
2605 
2606 
2607 
2608 
2609 
2610 
2611 
2612 
2613 
2614 
2615 
2616 
2617 
2618 
2619 
2620 
2621 
2622 
2623 
2624 
2625 
2626 
2627 
2628 
2629 
2630 
2631 
2632 
2633 
2634 
2635 
2636 
2637 
2638 
2639 
2640 
2641 
2642 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 1626 



DATE: 09/03/1999 
TIME: 14:51:46 



INPUT SET: S3321Lraw 



(ix) FEATURE: rp^ 

(A) NAME/KEY :/unsure 

(B) LOCATION^ replace (372, "g") 

(ix) FEATURE: 

(A) NAME/KEYy^nsure 

(B) L0CATI0e{: replace (379, "g") 




(ix) FEATURE: 

(A) NAME/KEYi^nsure 

(B) LOCATI01sk^eplace(786, "t") 

(ix) FEATURI 

(^^.^••''■'NAME/KEY : unsure 
^) LOCATION: replace ( 1105 1106 , 

(D) OTHER INFORMATION: /note= "also^pbssible "gg" and "aa"" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 48 
Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe 
15 10 

CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 

TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 144 
Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 
35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 

CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 240 
Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 

ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 288 
Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His 
80 85 90 

ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 336 
lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 110 



PAGE: 36 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031 A TIME: 14:51:46 



INPUT SET: S33211,raw 

^^^^ 71 A, 

2644 ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TOC/ TAT GTG gfcc AAT 3 84 

2645 Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr ❖ail Thr Asri 

2646 115 120 125 
2647 

2648 TCG AAC CAA CCC TTT TTT GTT GTT GAG ATG TTC AAC TTA CGC TCC ATA 432 
2 649 Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie 
2650 130 135 140 

2651 

2652 AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 48 0 

2653 Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 

2654 145 150 155 
2655 

2656 CTT GGT GAA GTC TAC TAG CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 528 

2657 Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala 

2658 160 165 170 
2659 

2660 TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 576 

2661 Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 

2662 175 180 185 190 
2663 

2664 GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 624 

2665 Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 

2666 195 200 205 
2667 

2668 AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 672 

2669 Asn lie Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn 

2670 210 215 220 
2671 

2672 CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC AC A GGT GGT GGT 720 

2673 Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly 

2674 225 230 235 
2675 

2676 GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 768 . 

2677 Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg 

2678 240 245 250 

2679 pi 

2680 GTT CCT ACC ACT GTG AC^GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 

2681 Val Pro Thr Thr Val Tfir Val Phe Asn Val Gin Arg Thr Ser Glu Gin 

2682 255 260 265 270 
2683 

2684 AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 

2685 Asn Leu Ser Thr lie Ala His Arg Trp lie Gin Val Ala Asp Lys Leu 

2686 275 280 285 
2687 

2688 GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 

2689 Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val lie Asn Asn Thr 

2690 290 295 300 
2691 

2692 AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 960 

2693 Asn Gly Glu Lys Thr lie Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 

2694 305 310 315 
2695 



PAGE: 37 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/2S8,031A TIME: 14:51:47 

INPUT SET: S33211,raw 

2696 AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 1008 

2697 Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 

2698 320 325 330 
2699 

2700 GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 1056 

2701 Gly Val Glu lie Ser Asp Cys lie Glu Met Ser Trp lie Glu Ser Val 

2702 335 340 345 350 
2703 

2704 
2705 

2706 CTT TTC TAC ACA AAC TTC CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 1104 

2707 Leu Phe Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu 

2708 355 360 365 

2710 (l^ CGT ACA CCT CAA AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 

2711 SC152 . . — ' 



2712 Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr 

2713 370 375 380 
2714 

2715 GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 

2716 1200 

2 717 Val Lys Asn Thr lie Ser Lys Gin Gly Phe Glu Ser lie Phe Glu Arg 

2718 385 390 395 

2719 

2720 ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 

2721 1248 

2722 Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 

2723 400 405 410 
2724 

2725 AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 1296 

2726 Arg Met Ser Glu lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser 

2727 415 420 425 430 
2728 

2729 GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC — - 

2730 1344 

2731 Gly Asn lie Ala Lys lie Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 

2732 435 440 445 
2733 

2734 GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 1392 

2735 Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 

2736 450. 455 460 
2737 

2738 TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 1440 
273 9 Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
2740 465 470 475 

2741 

2742 TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 1488 

2743 Tyr Arg Asp Leu Asp lie Gly Val Asn Ser His Gly Lys Asn Ala Tyr 

2744 480 485 490 
2745 

2 746 GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 

2747 1536 

2748 Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 



& 



PAGE: 38 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:47 

INPUT SET: S33211.raw 
2749 495 500 505 510 

2750 

--> 2751 AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 1584 

2752 Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 

2753 515 520 525 
2754 

2755 AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 1626 

2756 Arg Asn Glu Gin Ser lie Pro Thr Leu Ser Ser Ser Trp Lys 

2757 530 535 540 
2758 

2759 TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG 
--> 2760 GTCATGGTAT 1686 
2761 

2762 CCAGATTCAG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA 
--> 2763 AATTATATTA 1746 

2764 

2 765 TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT 
--> 2 766 GATGTTGATA 1806 

2767 

2768 TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG 
--> 2769 ACTTCCCTTG 1866 
2770 

2771 GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT 
--> 2772 CATTGGATGC 1926 

2773 

2 774 AATTTATTAT TATAAACCTT ATTTAAAAAA AAAAAAAAAA AAAAAAAAAA 
--> 2775 AAAAA 1981 
2776 
2777 




3081 (2) INFORMATION FOR SEQ ID NO: 70: 
3082 

3083 (i) SEQUENCE CHARACTERISTICS: 

--> 3084 (A) LENGTH: 1757 base pairs 

3085 (B) TYPE: nucleic acid 

3 086 (C) STRANDEDNESS : double 

3087 (D) TOPOLOGY: linear 
3088 

3089 (ii) MOLECULE TYPE: DNA (genomic) 
3090 

3 091 (iii) HYPOTHETICAL: NO 
3092 

3093 (iii) ANTI-SENSE: NO 
3094 

3 095 (vi) ORIGINAL SOURCE: 

3 096 (A) ORGANISM: Arabidopsis thaliana 

3 097 (B) STRAIN: Colombia 

3098 

3 099 (ix) FEATURE: 

3100 (A) NAME/KEY: CDS 

3101 (B) LOCATION: join(1..570, 801.. 1754) 
3102 

3103 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
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RAW SEQUENCE LISTMG 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:47 



INPUT SET: S33211.raw 

3104 

3105 ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 48 

3106 Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 

3107 15 10 15 
3108 

310 9 TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 

3110 Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 

3111 20 25 30 
3112 

3113 GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 

3114 Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 

3115 35 40 45 
3116 

3117 . AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 

3118 Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 

3119 50 55 60 
3120 

3121 GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 240 

3122 Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 

3123 65 70 75 80 
3124 

3125 CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 

3126 288 

3127 Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 

3128 85 90 ^ 95 
3129 

--> 3130 GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 

3131 Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 

3132 100 105 110 
3133 

3134 . CGT TCG GTG GAT GTC GAT GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 

3135 Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 

3136 115 120 125 
3137 

313 8 GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 432 
313 9 Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
3140 130 135 140 

3141 

3142 ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 

3143 Thr Leu Ala Tyr'^ Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 

3144 145 150 155 160 
3145 

3146 CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 528 

3147 His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 

3148 165 170 175 
3149 

3150 ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTT AAT 570 

3151 Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 

3152 180 185 190 
3153 

3154 GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG 
--> 3155 ATCGGATAGT 630 
3156 




PAGE: 40 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:48 

INPUT SET: S33211.raw 





3157 


GATTTTGGTC ' 


CATCAATTAA AAACTTGGTG AACATAAAAT 


TAACCAAGCA 






-> 


3158 


ATCAATTTAG 


690 




























3159 




































3160 


ACAAGCAACA ' 


TAATCATATA TATTTTTCTT ACATTTGTAT 


GTACCTGAAT 






-> 


3161 


ATTTATATTT 


750 




























3162 




































3163 


ATGTTTATAT ( 


GTTCTCACTA TATTTTCACT TTTGTATTTG 


AAAATTTTTA GGA AAA 




3164 


806 


































3165 




























Gly Lys 




3166 


































-> ' 


3167 


ATT 


TTG 


GAT 


AGA 


AAA 


TTG 


ATG 


GGA 


GAA 


GAT 


CTC 


TAC 


TGG 


GCA 


ATA 


AAC 




3168 


He 


Leu 


Asp 


Arg 


Lys 


Leu 


Met 


Gly 


Glu 


Asp 


Leu 


Tyr 


Trp 


Ala 


He 


Asn 




3169 






195 










200 










205 










3170 




































3171 


GGA 


GGA 


GGA 


GGA 


GGG 


AGC 


TAC 


GGC 


GTC 


GTA 


TTG 


GCC 


TAC 


AAA 


ATA 


AAC 




3172 


902 


































3173 


Gly 


Gly 


Gly Gly 


Gly 


Ser 


Tyr 


Gly 


Val 


Val 


Leu 


Ala 


Tyr 


Lys 


He 


Asn 




3174 




210 










215 










220 












3175 


































- > 


3176 


CTT 


GTT 


GAA 


GTC 


CCA 


GAA 


AAC 


GTC 


ACC 


GTT 


TTC 


AGA 


ATC 


TCC 


CGG 


ACG 




3177 


Leu 


Val 


Glu 


Val 


Pro 


Glu 


Asn 


Val 


Thr 


Val 


Phe 


Arg 


He 


Ser 


Arg 


Thr 




3178 


225 










230 










235 










240 




3179 




































3180 


TTA 


GAA 


CAA 


AAT 


GCG 


ACG 


GAT 


ATC 


ATT 


CAC 


CGG 


TGG 


CAA 


CAA 


GTT 


GCA 




3181 


Leu 


Glu 


Gin 


Asn 


Ala 


Thr 


Asp 


He 


He 


His 


Arg 


Trp 


Gin 


Gin 


Val 


Ala 




3182 










245 










250 










255 






3183 




































3184 


CCG 


AAG 


CTT 


CCC 


GAC 


GAG 


CTT 


TTC 


ATA 


AGA 


ACA 


GTC 


ATT 


GAC 


GTA 


GTA 




3185 


Pro 


Lys 


Leu 


Pro 


Asp 


Glu 


Leu 


Phe 


He 


Arg 


Thr 


Val 


He 


Asp 


Val 


Val 




3186 








260 










265 










270 








3187 




































3188 




































3189 


AAC 


GGC 


ACT 


GTT 


TCA 


TCT 


CAA 


AAG 


ACC 


GTC 


AGG 


ACA 


ACA 


TTC 


ATA 


GCA 




3190 


1094 
































3191 


Asn 


Gly 


Thr 


Val 


Ser 


Ser 


Gin 


Lys 


Thr 


Val 


Arg 


Thr 


Thr 


Phe 


He 


Ala 




3192 






275 










280 










285 










3193 


































- > 


3194 


ATG 


TTT 


CTA 


GGA 


GAC 


ACG 


ACA 


ACT 


CTA 


CTG 


TCG 


ATA 


TTA 


AAC 


CGG 


AGA 




3195 


Met 


Phe 


Leu 


Gly 


Asp 


Thr 


Thr 


Thr 


Leu 


Leu 


Ser 


He 


Leu 


Asn 


Arg 


Arg 




3196 




290 










295 










300 












3197 




































3198 


TTC 


CCA 


GAA 


TTG 


GGT 


TTG 


GTC 


CGG 


TCT 


GAC 


TGT 


ACC 


GAA 


ACA 


AGC 


TGG 




3199 


Phe 


Pro 


Glu 


Leu 


Gly 


Leu 


Val 


Arg 


Ser 


Asp 


Cys 


Thr 


Glu 


Thr 


Ser 


Trp 




3200 


305 










310 










315 










320 




3201 




































3202 


ATC 


CAA 


TCT 


GTG 


CTA 


TTC 


TGG 


ACA 


AAT 


ATC 


CAA 


GTT 


GGT 


TCG 


TCG 


GAG 




3203 


He 


Gin 


Ser 


Val 


Leu 


Phe 


Trp 


Thr 


Asn 


He 


Gin 


Val 


Gly 


Ser 


Ser 


Glu 




3204 










325 










330 










335 






3205 




































3206 


ACA 


CTT 


CTA 


CTC 


CAA 


AGG 


AAT 


CAA 


CCC 


GTG 


AAC 


TAC 


CTC 


AAG 


AGG 


AAA 




3207 


1286 
































3208 


Thr 


Leu 


Leu 


Leu 


Gin 


Arg 


Asn 


Gin 


Pro 


Val 


Asn 


Tyr 


Leu 


Lys 


Arg 


Lys 




3209 








340 










345 










350 







854 



950 



998 
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3210 
--> 3211 

3212 
3213 
3214 
3215 
3216 
3217 
3218 
3219 
3220 
3221 
3222 
3223 
3224 
3225 
3226 
3227 
3228 
3229 
3230 
3231 
3232 
3233 
3234 
--> 3235 
3236 
3237 
3238 
3239 
3240 
3241 
3242 
3243 
3244 
3245 
3246 
3247 
3248 
3249 
3250 
3251 
3252 
3253 
3254 
3255 
3256 
3257 
3258 
3259 



RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:48 

INPUT SET: S33211.raw 

TCA GAT TAG GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1334 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 
1382 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 

370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 
1430 

Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 

385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 
1478 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 
1526 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1574 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 1622 
Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 
1670 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 
1718 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 



TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 1757 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



3368 (2) INFORMATION FOR SEQ ID NO: 72: 
3369 



PAGE: 42 RAW SEQUENCE LISTING DATE: 09/03/1999 

PATENT APPLICATION US/09/258,031A TIME: 14:51:48 

INPUT SET: S33211.raw 

3 37 0 (i) SEQUENCE CHARACTERISTICS: 

--> 3371 (A) LENGTH: 1527 base pairs 

3372 (B) TYPE: nucleic acid 

3373 (C) STRANDEDNESS : double 
3 374 (D) TOPOLOGY: linear 
3375 

3 3 76 (ii) MOLECULE TYPE: CDNA to mRNA 

3377 

3 378 (iii) HYPOTHETICAL: NO 

3379 

3380 (iii) ANTI-SENSE: NO 

3381 

3 3 82 (vi) ORIGINAL SOURCE: 

3 383 (A) ORGANISM: Arabidopsis thaliana 

3384 (B) STRAIN: Colombia 

3385 

3386 (ix) FEATURE: 

3387 (A) NAME/KEY: CDS 

3388 (B) LOCATION: 1 . . 1524 
3389 

3 3 90 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

3391 

33 92 ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 48 

3393 Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 

3394 15 10 15 
3395 

33 96 TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
33 97 Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
3398 20 25 30 

3399 

3400 GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 

3401 Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 

3402 35 40 45 
3403 

3404 AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 

3405 Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 

3406 50 55 60 
3407 

3408 GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 240 

3409 Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 

3410 65 70 75 80 
3411 

3412 CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 

3413 288 

3414 Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 

3415 85 90 95 
3416 

--> 3417 GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 

3418 Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 

3419 100 105 110 
3420 

3421 CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 

3422 Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 




.2^ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031A 



DATE: 09/03/1999 
TIME: 14:51:49 




INPUT SET: S33211.raw 

3423 115 120 125 

3424 

3425 GGT GCC ATA CTC GGA GAA GTT TAT TAG TAT ATA TGG GAG AAG AGO AAA 432 

3426 Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 

3427 130 135 140 
3428 

342 9 ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 

343 0 Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
3431 145 150 155 160 
3432 

343 3 CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAG GGT CTC 528 

3434 His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 

3435 165 170 175 
3436 

343 7 ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 
3438 576 

343 9 Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
3440 180 185 190 

3441 

-> 3442 ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAG TGG GCA ATA AAC 624 

3443 lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 

3444 195 200 205 
3445 

3446 GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 

3447 672 

3448 Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 

3449 210 215 220 
3450 

-> 3451 CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 720 

3452 Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr, 

3453 225 230 235 240 
3454 

3455 TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 768 

3456 Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 

3457 245 250 255 
3458 

3459 CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 816 

3460 Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 

3461 260 265 270 
3462 

3463 AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 864 

3464 Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 

3465 275 280 285 
3466 

3467 ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 912 

3468 Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 

3469 290 295 300 
3470 

3471 TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 960 

3472 Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 

3473 305 310 315 320 
3474 

3475 ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 1008 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:49 



3476 
3477 
3478 
3479 
3480 
3481 
3482 
3483 
3484 
3485 
3486 
3487 
3488 
3489 
3490 
3491 
3492 
3493 
3494 
3495 
3496 
3497 
3498 
3499 
3500 
3501 
3502 
3503 
3504 
3505 
3506 
3507 
3508 
3509 
3510 
3511 
3512 
3513 
3514 
3515 
3516 
3517 
3518 
3519 
3520 
3521 
3522 
3523 
3524 
3525 
3526 
3527 
3528 



INPUT SET: S33211.raw 
lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 



ACA CTT 
1056 
Thr Leu 



TCA GAT 

Ser Asp 



TGG AAG 
1152 
Trp Lys 
370 

TAG GGT 

Tyr Gly 
385 

TAG AGA 
1248 
Tyr Arg 



GAT GAG 
1296 
Asp Glu 



OTA CTC CAA AGG AAT CAA CCC GTG AAC TAG CTC AAG AGG AAA 

Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 . 345 350 



TAG GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1104 

Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 



AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GGT TTC AAT CCA 

Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
375 380 



Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

ACT TTA ACC GAG CGG TAG ATG GAA TTG AGG AGG AAG TTG TAG 

Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 

420 425 430 



CAA TTC 

Gin Phe 



AAT TAG 
Asn Tyr 
450 

AGT AGT 
1440 
Ser Ser 
465 

AAT TTC 
1488 
Asn Phe 



TTC TTT 

Phe Phe 



TAT GTG GAA GGT AAA CGT TAG GGG AAG AAG TAT TTC GCA GGT 

Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
470 475 480 

GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 

Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 

Arg Asn Glu Gin Ser lie Pro Val Leu Pro 

500 505 




GGT GAG ATG GGG AGG ATA TCA TCT ACG GTG ACT CCG TTC CCA 1200 

Gly Glu Met Gly Arg lie Ser Ser Thr Val Thr Pro Phe Pro 
390 395 400 

GCC GGT AAT CTC TGG AAG ATT CAG TAG GGT GCG AAT TGG AGA 




ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1344 

Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 13 92 
Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
455 460 



1527 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/258,031 A 



DATE: 09/03/1999 
TIME: 14:51:50 



INPUT SET: S33211.raw 



3529 




3 63 9 (2) INFORMATION FOR SEQ ID NO : 74: 
3640 

3 641 (i) SEQUENCE CHARACTERISTICS: 

--> 3642 (A) LENGTH: 1530 base pairs 

3643 (B) TYPE: nucleic acid 

3644 (C) STRANDEDNESS: double 

3645 (D) TOPOLOGY: linear 
3646 

3 647 (ii) MOLECULE TYPE: CDNA to mRNA 

3648 

3649 (iii) HYPOTHETICAL: NO 

3650 

3651 (iii) ANTI-SENSE: NO 

3652 

3653 (vi) ORIGINAL SOURCE: 

3654 (A) ORGANISM: Arabidopsis thaliana 

3655 (B) STRAIN: Colombia 
3656 

3 657 (ix) FEATURE: 

3 658 (A) NAME/KEY: CDS 

3659 (B) LOCATION: 1..1527 

3660 

3661 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

3662 

3 663 TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 4 8 

3664 Ser lie Gin Asp Gin Phe lie Asn Cys Val Lys Arg Asn Thr His Val 

3665 15 10 15 
3666 

3667 TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 96 

3668 Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 

3669 20 25 30 
3670 

3671 

3672 TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC CAG TTC TTG 144 

3673 Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 

3674 35 40 45 
3675 

3 676 GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA CCG ATT CAC CAG 192 

3677 Ala Lys Ser Met Pro Lys Pro Gly Phe lie Phe Arg Pro lie His Gin 

3678 50 55 60 
3679 

3680 TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 240 

3681 Ser Gin Val Gin Ala Ser lie lie Cys Ser Lys Lys Leu Gly lie His 

3682 65 70 '75 80 
3683 

3 684 TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 288 

3685 Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 

3686 85 90 95 
3687 

3688 TCA CGG ATT GAA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 3 36 

3689 Ser Arg lie Glu Lys Pro Phe lie Leu Leu Asp Leu Ser Lys Leu Lys 
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3690 100 105 110 

3691 

36 92 CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 384 

3693 Gin lie Asn Val Asp lie Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 

3694 115 120 125 
3695 

3 696 GCT ACG CTT GGT GAG CTT TAG TAG AGA ATT GCA GAG AAG AGO AAG ATC 432 

3697 Ala Thr Leu Gly Glu Leu Tyr Tyr Arg lie Ala Glu Lys Ser Lys lie 

3698 130 135 140 
3699 

3700 CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 480 

3701 His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 

3702 145 150 155 160 
3703 

3704 ATG ACA GGC GGT GGA TAC GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 528 

3705 Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 

3706 165 170 175 
3707 

3 708 GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 576 

3709 Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 

3710 180 185 190 
3711 

3 712 CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 

3713 624 

3714 Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala lie Arg Gly 

3715 195 200 205 
3716 

--> 3717 GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT 672 

3718 Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 

3719 210 215 220 
3720 

3 721 GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 72 0 

3722 Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 

3723 225 230 235 240 
3724 

3 725 GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 768 

3726 Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 

3727 245 250 255 
3728 
3729 
3730 

3 731 TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 

3732 Ser Lys lie He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 

3733 260 265 270 
3734 

373 5 GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAC CTA GGT CAG TTT 864 
3 73 6 Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
3737 275 280 285 

3738 

373 9 CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 
3 740 Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
3741 290 295 300 

3742 
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3743 


GAA 


CTA 


GGG 


TTA 


ACT 


CAA 


AAG 


GAT 


TGT 


ACT 


GAA 


ATG 


AGC 


TGG 


ATT 


GAA 


3744 


Glu 


Leu 


Gly Leu 


Thr 


Gin 


Lys 


Asp 


Cys 


Thr 


Glu 


Met 


Ser 


Trp 


He 


Glu 


3745 


305 










310 










315 










320 


3746 


































3747 


GCC 


GCC 


CTT 


TTC 


CAT 


GGT 


GGA 


TTT 


CCA 


ACA 


GGT 


TCT 


CCT 


ATT 


GAA 


ATT 


3748 


Ala 


Ala 


Leu 


Phe 


His 


Gly Gly 


Phe 


Pro 


Thr 


Gly 


Ser 


Pro 


He 


Glu 


He 


3749 










325 










330 










335 




3750 


































3751 


TTG 


CTT 


CAG 


CTC 


AAG 


TCG 


CCT 


CTA 


GGA 


AAA 


GAT 


TAC 


TTC 


AAA 


GCA 


ACG 


3752 


Leu 


Leu 


Gin 


Leu 


Lys 


Ser 


Pro 


Leu 


Gly 


Lys 


Asp 


Tyr 


Phe 


Lys 


Ala 


Thr 


3753 








340 










345 










350 






3754 


































3755 


TCG 


GAT 


TTC 


GTT 


AAA 


GAA 


CCT 


ATT 


CCT 


GTG 


ATA 


GGC 


TTC 


AAA 


GGA 


ATA 


3756 


Ser 


Asp 


Phe 


Val 


Lys 


Glu 


Pro 


He 


Pro 


Val 


He 


Gly 


Phe 


Lys 


Gly 


He 


3757 






355 










360 










365 








3758 


































3759 


TTC 


AAA 


AGA 


TTG 


ATT 


GAA 


GGA 


AAC 


ACA 


ACA 


TTT 


CTG 


AAC 


TGG 


ACT 


CCT 


3760 


Phe 


Lys 


Arg 


Leu 


He 


Glu Gly 


Asn 


Thr 


Thr 


Phe 


Leu 


Asn 


Trp 


Thr 


Pro 


3761 




370 










375 










380 










3762 


































3763 


TAG 


GGT 


GGT 


ATG 


ATG 


TCG 


AAA 


ATC 


CCT 


GAA 


TCT 


GCG 


ATC 


CCA 


TTT 


CCG 


3764 


Tyr 


Gly Gly Met 


Met 


Ser 


Lys 


He 


Pro 


Glu 


Ser 


Ala 


He 


Pro 


Phe 


Pro 


3765 


385 










390 










395 










400 


3766 


































3767 


CAT 


AGA 


AAC 


GGA 


ACC 


CTC 


TTC 


AAG 


ATT 


CTC 


TAT 


TAC 


GCG 


AAC 


TGG 


CTA 


3768 


His 


Arg Asn Gly 


Thr 


Leu 


Phe 


Lys 


He 


Leu 


Tyr 


Tyr 


Ala 


Asn 


Trp 


Leu 


3769 










405 










410 










415 




3770 


































3771 


GAG 


AAT 


GAC 


AAG 


ACA 


TCG 


AGT 


AGA 


AAA 


ATC 


AAC 


TGG 


ATC 


AAA 


GAG 


ATA 


3772 


1296 






























3773 


Glu 


Asn Asp 


Lys 


Thr 


Ser 


Ser 


Arg 


Lys 


He 


Asn 


Trp 


He 


Lys 


Glu 


He 


3774 








420 










425 










430 






3775 


































3776 


TAG 


AAT 


TAG 


ATG 


GCG 


CCT 


TAT 


GTC 


TCA 


AGC 


AAT 


CCA 


AGA 


CAA 


GCA 


TAT 


3777 


Tyr 


Asn 


Tyr 


Met 


Ala 


Pro 


Tyr 


Val 


Ser 


Ser 


Asn 


Pro 


Arg 


Gin 


Ala 


Tyr 


3778 






435 










440 










445 








3779 


































3780 


GTG 


AAC 


TAC 


AGA 


GAT 


CTA 


GAC 


TTC 


GGA 


CAG 


AAC 


AAG 


AAC 


AAC 


GCA 


AAG 


3781 


1392 






























3782 


val 


Asn 


Tyr 


Arg 


Asp 


Leu 


Asp 


Phe 


Gly 


Gin 


Asn 


Lys 


Asn 


Asn 


Ala 


Lys 


3783 




450 










455 










460 










3784 


GTT 


AAC 


TTC 


ATT 


GAA 


GCT 


AAA 


ATC 


TGG 


GGA 


CCT 


AAG 


TAC 


TTC 


AAA 


GGC 


3785 


Val 


Asn 


Phe 


lie 


Glu 


Ala 


Lys 


He 


Trp 


Gly 


Pro 


Lys 


Tyr 


Phe 


Lys 


Gly 


3786 


465 










470 










475 










480 


3787 


































3788 


AAT 


TTT 


GAC 


AGA 


TTG 


GTG 


AAG 


ATT 


AAA 


ACC 


AAG 


GTT 


GAT 


CCA 


GAG 


AAC 


3789 


1488 






























3790 


Asn 


Phe 


Asp 


Arg 


Leu 


Val 


Lys 


He 


Lys 


Thr 


Lys 


Val 


Asp 


Pro 


Glu 


Asn 


3791 










485 










490 










495 




3792 


































3793 


TTC 


TTC 


AGG 


CAC 


GAG 


CAG 


AGT 


ATC 


CCA 


CCT 


ATG 


CCC 


TAC 


TAG 






3794 


Phe 


Phe 


Arg 


His 


Glu 


Gin 


Ser 


He 


Pro 


Pro 


Met 


Pro 


Tyr 








3795 








500 










505 
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960 



1008 



1200 




1440 




1530 



